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Abstract. In this paper we introduce three Markovian couplings of Brownian motions on smooth Riemannian manifolds 
without boundary. The first is a shy coupling, in the sense that the distance between the processes is bounded below 
by a certain positive constant for all times, the second is a coupling in which the distance between the processes is a 
deterministic exponentially decaying functions of time, and finally, the third is a coupling in which the distance between 
the processes is constant in time, in short the fixed-distance coupling. 

The simplest nontrivial manifold is the 2-dimensional sphere in M 3 , and in this case we give the explicit construction 
of all three types of couplings mentioned above. The construction we provide at first is geometrically extrinsic. 

Next, we extend the construction of the first two couplings to manifolds of constant curvature, this time using the 
intrinsic geometry. 

Then we prove a full result which shows that an arbitrary Riemannian manifold satisfying some general conditions 
on the curvature supports shy couplings. Moreover, if in addition the Ricci curvature is non-negative, there exist fixed- 
ly distance couplings. Furthermore, if the Ricci curvature is bounded below by a positive constant, then there exists a cou- 
pling of Brownian motions for which the distance between the particles is deterministic and exponentially decaying. The 
proof is intrinsic, and relies on an extension of the notion of frames which plays an important role in the economy of the 
CO proof for even dimensional manifolds. 

As an application of the fixed-distance coupling we derive a maximum principle for the gradient of harmonic functions 
on manifolds with non-negative Ricci curvature. 

^ 1. Introduction 

The present article was initially motivated by the following (stochastic) modification of the classical Lion and 
Man problem of Rado ([12]) on manifolds. Consider a Brownian Lion X t and a Brownian Man Y t running on a 
d-dimensional Riemannian manifold M (for instance the unit sphere in M 3 ). 
(T) We consider the following two versions of the classical Lion and Man problem. 

> 

j — Problem 1 (Fast/Finite Time Coupling). Can the Lion capture the Man? 

—i More precisely, given two distinct starting points x,y e M and a Brownian motion Y t on M starting at y, can one find 

CN a Brownian motion X t on M starting at x such that r = inf {t > : X t — Y t } is almost surely finite (or almost surely 
bounded)? A weaker version of this problem is whether for a given e > and a given Brownian motion Y t on M starting at y 
one can find a Brownian motion X t on M starting at x such that r = inf {t > : d(X t ,Y t ) = e} is almost surely finite (or 
almost surely bounded). Here d(x, y) stands for the Riemannian distance on M. 

CN 

7— i Problem 2 (Strong Shy Coupling). Can the Man avoid being eaten by the Lion indefinitely? 

More precisely, given two distinct starting points x,y € M and a Brownian motion X t on M starting at x, can one find a 
Brownian motion Y t on M starting at y such that almost surely X t ^ Y t for allt>0?A stronger version of the question is 
whether the Brownian motion Y t can be chosen in such a way that there exists ane > such that almost surely d (X t , Y t ) > e 
for all t > 0. 

The notion of shy couplings of Brownian motions was introduced in [1] and subsequently studied in [2] and [10] 
and is a coupling for which, with positive probability the distance between the two processes stays positive for 
all times. A stronger form of shyness (e-shyness, e > 0) asserts that with positive probability the distance between 
the processes is greater than e. In this paper we use this latter version of shyness, in the stronger case when the 
distance between the processes is greater than e with probability 1. 

To set up the terminology, we mention that all couplings in the present paper are Markovian couplings in the 
sense of [1], as defined at the end of section Section 2. 

We note that on the unit sphere S 2 , there is an immediate affirmative answer to Problem 1: one can define X t 
as the symmetric of Y t with respect to the plane of symmetry of x and y. Since the Brownian motion Y t hits this 
plane in finite time, t is finite almost surely, so the Lion is sure to capture the Man in finite time. However, in 
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this solution of Problem 1 the capture time r is not almost surely bounded, and since the solution relies on the 
fact that the Brownian motion on S 2 is recurrent, it cannot be translated into a resolution of the corresponding 
non-deterministic problem. 

The above-mentioned coupling is known in the literature as the mirror coupling, and it was introduced by Lind- 
vall and Rogers [11] for processes defined on Euclidean spaces, and by Kendall [9] and Cranston in [6] in the case 
of processes defined on manifolds, the so-called Cranston-Kendall mirror coupling. It turns out that this coupling is 
a very useful and versatile construction when it comes to various geometric and analytic properties on manifolds. 
For instance it was shown in [9] in the case of manifolds with Ricci curvature bounded uniformly from below by 
a positive constant, one can show that the Man and the Lion must meet in finite time under this mirror coupling. 
In fact, the expected distance between the Lion and the Man decays exponentially fast. On the other hand, on 
negatively curved manifolds, mirror-coupled Brownian motions tend to get away from each other, in the sense 
that on the average the distance between the particles increases. 

In the case of the unit sphere S 2 , due to the spherical symmetry of the problem it is easy to see that the answer 
to the weaker form of Problem 2 is affirmative: one can define Y t by a convenient rotation of X t , thus ensuring 
that Y t X t for all times (since the Brownian motion on S 2 does not hit the two points where the axis of rotation 
intersects the sphere). However, the stronger form of the question does not follow from this argument, and it is 
not immediate that the answer is affirmative. In fact, at the first glance it seems that the answer is negative: if Y t is 
obtained from X t by using a rotation, then X t can get arbitrarily close to Y t at times when it is sufficiently close to 
the two points where the axis of rotation intersects the unit sphere. The glitch in this "argument" is that the latter 
set of times can be empty (by choosing convenient instantaneous rotations, we can preclude the processes from 
coming too close to the instantaneous axis of rotation). 

The first results of this paper treat the particular case of the manifold S 2 . We show in this case that we can 
construct couplings of spherical Brownian motions for which the distance between the two process is deterministic 
and decays exponentially fast to 0, respectively it increases exponentially fast to 2ir. The first is a fast approaching 
coupling while the second is a repulsive coupling (or a perverse coupling in the terminology of [10]). 

The above results lead to the natural question of whether one can construct a fixed-distance coupling, that is a 
Markovian coupling of two Brownian motions which remain at fixed-distance for all times. Since S 2 supports 
fast approaching couplings and repulsive (perverse) ones, it is feasible to expect that it should also support a 
fixed-distance coupling. 

Here is an outline of the paper. Section 2 is about notations and basic results and notions. Then, Section 3, as 
a warm up, is about the existence of fixed-distance couplings on K™. Here we show that the only fixed-distance 
coupling is the trivial one, namely the translation coupling. In fact, we show a little more, namely that there is 
no distance-decreasing coupling in K™. This is to be contrasted to the fact that on S 2 it is possible to construct 
distance-decreasing couplings. 

In Section 4 we prove the existence of the fast approaching, the repulsive, and the fixed-distance couplings on S 2 
alluded above. The construction is carried out using two ingredients: Stroock's representation of spherical Brown- 
ian motion and Kendall's characterization of co-adapted couplings of Brownian motions in Euclidean spaces (see 
[10]). From a differential geometric perspective this construction is extrinsic, in the sense that the sphere S 2 is seen 
as a submanifold of R 3 , and we take advantage of this in order to reduce the problem at hand to that of finding 
unitary matrices in M 3 satisfying certain conditions. The intriguing part about this construction is that the same 
argument does not extend to higher dimensional spheres. 

The above results raise the natural question whether this is the end of the story. Are the couplings obtained in 
the particular case of the sphere (or the Euclidean space) something accidental? What is really responsible for the 
existence of these couplings? Phrased differently, is it important that the sphere has so many nice symmetries or 
is it not? We answer this question from a differential geometric perspective, by proving that the existence of these 
couplings is in fact related to the intrinsic geometry of the sphere rather than the extrinsic one. 

We address the above questions in Section 5 in the case of manifolds of constant curvature (in any dimen- 
sion), and treat the case of distance increasing/ decreasing couplings. We switch from the extrinsic approach used 
in Section 4 to the intrinsic approach, and we prove that manifolds of constant negative curvature support shy 
couplings. In the particular case of the hyperbolic space this shows that we can construct couplings of Brown- 
ian motions which get away from each other exponentially fast (nothing very surprising after all). Interestingly 
enough, on manifolds of constant positive curvature we can construct shy couplings as well as couplings in which 
the processes get exponentially close to each other, but do not touch each other. In the particular case of spheres 
(in any dimension) we construct the exponentially decreasing /increasing couplings, which in the case of S 2 coin- 
cide with the ones from Section 4. We could have included here the construction of fixed-distance couplings on 
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manifolds of constant curvature, but technically this is very much what we do later on, so we postponed this for 
Section 6. 

The reason for including these calculations on constant curvature manifolds is that on one hand this is an 
intrinsic approach and on the other hand the computations for the distance functions is instructive, relatively 
standard and simple. 

A word about the techniques involved in Section 5 is in place here. The construction of the Brownian motions on 
manifolds uses the orthonormal frame bundle. This is a standard approach. When studying the coupling though, 
the natural tool for analyzing it is the distance function, which ultimately is related to the index forms of Jacobi 
fields along geodesies. For a reference see for example [7]. 

In Section 6 we get a very general result. This states that on an arbitrary e?-dimensional Riemannian manifold 
M with the Ricci curvature uniformly bounded from below and the sectional curvature uniformly bounded from 
above we can construct shy couplings. This existence result of shy coupling on manifolds is also stated in Kendall 
[10] without proof but with a hint on how to do it. Our approach is a bit different. Moreover, if the Ricci curvature is 
in addition non-negative, we ca also construct fixed-distance couplings. Finally, we show that if the Ricci curvature 
is actually bounded from below by a positive constant, then we can construct fast approaching couplings, for 
which the distance between processes decays exponentially fast to 0. 

A few details on the idea of the proof are in order here. In the first place we treat separately the cases when 
d is odd, respectively even. In the case of odd dimensional manifolds we can carry out the proof based on the 
orthonormal frame bundle. However, in order to carry out the construction for even dimensional manifolds we 
have to introduce the notion of TV -frames at a point x £ M which is an embedding of the tangent space T X M into 
M. N . As it turns out, it suffices to use this construction for the particular case N = d + 1, however, for the general 
TV this may be of independent interest by itself. 

Suppose we have X t and Y t our Brownian motions. From a loose point of view what we do first is to split the 
orthogonal to the tangent space at X t into orthogonal planes. This splitting is possible only if the dimension d is 
odd. If this is the case, using the parallel transport along the geodesic, we can transport these planes at X t into 
orthogonal planes at Y t . The idea of the construction is to choose the motion Y t such that it gets rotated by the 
same angle (chosen appropriately) in all these planes. This is how we construct all three couplings. In the even 
dimensional case using the d + 1-frames we essentially add one more dimension to the tangent space and carry 
out the same program. There are though several obstacles we have to overcome. 

The details of the above construction can be summarized as follows. We first show that the construction of the 
coupling can be carried out locally, and then we show that we can extend this construction to the entire manifold. 
The local construction is a first necessary step of the proof, due to the fact that we cannot choose a smooth global 
orthonormal frame. The hypothesis that the sectional curvature is bounded from above provides two key elements 
of the proof: the first is that the injectivity radius is positive, and the second is that certain index forms of geodesies 
between nearby points is positive. 

In Section 7 we discuss some geometric aspects related to the main result in the previous section (Theorem 9), 
and we present a localized version of the shy coupling, which is used in Section 8 in order to give some applications 
of the fixed-distance coupling to the maximum principle of norms of the gradient of harmonic functions and 
solutions to heat equations on manifolds with non-negative Ricci curvature. We also translate the general result of 
Theorem 9 in terms of the stochastic Lion and Man problem. 

2. Preliminaries 

We identify the vectors in M 3 with the corresponding 3x1 column matrices, and for a vector iel 3 we denote 
by x' the transpose of x. The dot product of two vectors x,y E R 3 can be written in terms of matrix multiplication 
as x ■ y — x'y. The Euclidian length of a vector x £ R 3 is ||x|| = V x'x. 

We denote by S 2 = {x e M 3 : ||a;|| = l} the unit sphere in R 3 , and for x, y € S 2 we let d(x,y) be the length of 
the geodesic joining x and y on S 2 (the length of the smaller of the two arcs of a great circle containing x and y, 

that is d (x, y) — arcsin y 1 — (x'y) 2 = 2 arcsin (\\\x — y\\)). 

There are various ways of describing the spherical Brownian motion onS 2 , that is the Brownian motion on S 2 
(see for example [3]). In what follows we exploit the Stroock's representation of spherical Brownian motion ([13]), 
as the solution X t of the ltd stochastic differential equation 

(2.1) X t =X Q + f {I - X s X' s )dB s - f X s ds, 

Jo Jo 

where B t is a 3-dimensional Brownian motion. The last term above may be thought as the pull needed in order to 
keep X t on the surface of S 2 . 
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Given two non-parallel vectors x, y € S 2 (i.e. y ^ ±x), we denote by R x , y the 3x3 rotation matrix with axis 
u = x x y (the cross product of the vectors x and y) and angle 9 E (0, tt) equal to the angle between the vectors x 
and y, so in particular R x , y u = u and R x , y x = y. It is known that R x ,y is an orthogonal matrix (R^ 1 = R') and the 
following (Rodrigues' rotation) formula holds 

(2.2) R x v = cosOI + [u] x + 1 — -u®u, 

1 + cos 

where [u] x = yx' — xy' is the cross-product matrix of u = x x y, <£> denotes the tensor product (u <E) u = uu') and / 
denotes the 3x3 identity matrix. Note that the above formula differs slightly from the usual one, due to the fact 
that we do not require the axis u to be a unit vector. 

When y = ±x, the cross product u = x x y is the zero vector, so the rotation matrix R Xt ± x is not well defined in 
this case. However, if we define R Xt ± x = ±1 we see that the R x ,± x is still a unitary matrix and satisfies R x ,± x x — 
±x. Moreover, taking the limit as 8 — > (or tt, depending on whether y = x or y = —x), we see that (2.2) still holds. 

For a given d-dimensional Riemannian manifold M, we use the standard notations from [7] or [14] to denote 
by O(M) the orthonormal frame bundle. For a given orthonormal frame U at a point x E M and £ E M. d , H^(U) 
is the horizontal lift of US, E T X M at the point U E O(M). We will use the simpler notation of Hi for H e ., with 
{&i}i=i,...,n denoting the standard basis of R d . 

We collect here some notions from differential geometry which will be used in the sequel. The reader is referred 
to [4] or [5] for basic notions and results. The curvature tensor R x at x is R X (X, Y) = V^Vy — Vy Vx — V[x,Y] 
and the Ricci tensor is the contraction Ric x (X, Y) = ^>2i = i{Rx(X, Ei)Ei,Y), where {-Ei}i=i,...,<j is any orthonormal 
basis at x and X,Y E T X M. This definition of the Ricci tensor does not depend on the choice of orthonormal basis, 
and in the particular case of surfaces it simplifies to Ric x (X, Y) = K X (X, Y), where K is the Gauss curvature. 

We denote by d(x, y) the Riemannian distance between x and y. 

A geodesic on M is a smooth curve 7 : [a, b] —> M such that j(s) = for each s e [a, b], where the dot represents 
the covariant derivative along 7. Throughout the paper we assume that the geodesies are running at unit speed. 
For a point x g M, we define C x to be the cutlocus of x, that is the set of points y G M for which there is more 
then one minimizing geodesic between x and y. We will also use the notation Cut C M x M, defined as the set 
of all points (x, y) which are at each other's cut-locus. For points x,y € M which are not at each other's cut-locus, 
we define j X; y to be the unique unit speed minimizing curve joining x and y. The injectivity radius is the smallest 
number i(M) such that any two points at distance less than i(M) can be joined by a unique minimizing geodesic. 

Given a geodesic 7, a Jacobi field along 7 is a vector field J(s) such that 

(2.3) J( S )+ J R 7(s) (J( S ) )7 ( S )) 7 (s) = 0, 

where the dot represents the derivative along 7. 

Given a vector field V along a geodesic 7 defined on [a, b], the index form I associated to it is defined as 

(2.4) I(V,V) = [ (\V(s)\ 2 ~ (Ry(s)(V(s),j(sm(s),V(s)))d S , 

J a 

and using polarization 1 can be extended to a bilinear form on the space of vector fields along the geodesic 7. In 
the particular case when J is a Jacobi field, an integration by parts formula shows that 

(2.5) 1(J, J) = (j(b), J(b)) - (j(a), J(a)) 

where [a, b] is the definition interval of 7. 

A manifold has constant curvature r if the sectional curvature is r for all choices of the two dimensional plane, 
that is (R X (X, Y)Y, X) — r for any x € M and any ortogonal unit vectors X, Y 6 T X M. In this case the Ricci 
curvature simplifies as well as the Jacobi field equation (2.3). We record here the calculation, as it will be used later 
on. Assume that j X:V is the minimal geodesic between the points x, y E M which are not at each other's cut-locus, 
p = d(x, y) and let £ E T X M and 77 E T y M be two unit vectors. Consider £(s) the extension of £ by parallel transport 
along 7 from x to y, and similarly let r)(s) be the extension of r\ by parallel transport from y to x. The Jacobi field 
J^jj whose value at x is £ and 77 at y can be computed as follows 

(2.6) J 6 , v (s) = wi(s)£(s) + w 2 (s)r)(s) 
where w±,W2 solve the boundary value problems 

u)x + ru>i =0 [ W2 + ru>2 = 

wi(0) = l and < to 2 (0) = 

wi(p) = lto 2 (p) = l 
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whose solutions are 

{ sin(VF(p-s)) / q ( sin(^rs) / q 

Next, we introduce the main notions regarding couplings. Recall that in general by a coupling we understand 
a pair of processes (X t , Y t ) defined on the same probability space, which are separately Markov, that is 

P (X s+t eA\X s =z,X u :0<u<s)=P z (X t g A) 
P{Y s+t g A\Y S = z,Y u : < u < s) = P z (Y t g A) 

for any measurable set ^4 in the state space of the processes. 

The notion of Markovian coupling as used in [1] requires that in addition to the above, the joint process (X t ,Y t ) 
is Markov and 

P {X s+ t g A\ X s = 2, X u , y u :0<u< S ) = F z (I t £4) 
(2 ' 8) P(Y, +t eA\Y s = z,X u ,Y u :0<u<s) = P z (Y t g A) 

for any measurable set ^4 in the state space of the processes. 

The notion of co-adapted coupling (introduced by Kendall, [10]) is the same as the above but without the Markov 
property of (X t ,Y t ). 

The Markovian couplings are easily obtained for instance in the case when the process (X t ,Y t ) is actually a 
diffusion on the manifold. This would be the ideal case, but we still get a Markovian coupling if we patch together 
diffusion process in a nice way. For example this will be the case of the main construction on manifolds, where 
we start the coupling following a diffusion up to a certain stopping time, then, from the point it stopped we run it 
independently according to another diffusion and then stop this at another stopping time and so on. We do this 
quietly without further details. 

3. Distance-decreasing couplings in R d 

In this section we first examine the distance-decreasing couplings in the Euclidean space R d . To be precise, we 
want to find all possible co-adapted couplings (Xt, Y t ) of d -dimensional Brownian motions, for which the distance 
\\Xt — Yt\\ is a (deterministic) non-increasing function of t > 0. 

By a result on co-adapted couplings (Lemma 6 in [10]), a co-adapted coupling (X t , Y t ) of Brownian motions in 
M. d can be represented as 

Y t =Y + f J t dX t + f K t dZt, 
Jo Jo 

where Z is a d-dimensional Brownian motion independent of X (on a possibly larger filtration), and J,K g Mdxd 
are matrix-valued predictable random processes, satisfying the identity 

(3.1) J t J' t + K t K' t = I, 

with / denoting the d x d identity matrix. 

Setting Wt — X t — Y t and using Ito's formula we obtain 

d d 
d || W t \\ 2 = TW'tdWt + d ( wl )t = 2 {X t - Y t )' {I - J t ) dX t -2{X t - Y t )' K t dZ t + £ d(W%. 

i=i i=i 

Using the independence of X and Z, and the relation 3.1 we obtain 

d 

53 d(W i ) t = tr ((7 - J t )' [I - Jt) + K' t K t ) dt 

i=i 

= tr (I - J t - J' t + J' t Jt + K' t K t ) dt 
= 2 (tr {I) — tr (J t )) dt 
= 2 (d - tr (J t )) dt. 

From the last two equations we arrive at 

d \\W t \\ 2 = 2(X t - Y t )' (I - J t ) dX t -2(X t - Y t )' K t dZ t + 2 (d - tr (J t )) dt, 
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so the differential of the quadratic variation of the martingale part of || Wt\\ 2 is given by 



((2 (X t - Y t )' (I - J t )) (2 (X t - Y t )' (I - J t ))' + (2 (X t - Y t y K t ) (2 (X t - Y t )' K t )'^j dt 

= 4 (X t - Y t )' [I-Jt- J' t + JtJ't + K t K' t ) (X t - Y t ) dt 
= A{X t -Y t )'{2I-J t -J' t )(X t -Y t )dt 
= 8(X t -Y t y(I-J t )(X t -Y t )dt, 





with equality if and only if J t = I. 

On the other hand, from (3.1) it follows that < x' J' t J t x < x'x for all x € R d , so the eigenvalues A, = Aj (t) 
of J t J' t satisfy < \ < 1, and therefore tr(J' t J t ) — ^2f = iK < d. Combining with the above we conclude 
that tr (J' t Jt) — d, and therefore J t = I for all t > 0. Equivalently this shows that dY t = dX t for all t > 0, or 
Yt = Yq — Xq + X t , and we arrive at the following. 

Theorem 3. In the Euclidean space R d , d > 1, the only co-adapted coupling ofBrownian motions with deterministic non- 
increasing distance is the translation coupling. 

As we will see later on in Theorem 7 there are distance increasing couplings on W l . 



In this section we study the couplings of Brownian motions on the unit sphere S 2 . The primary interest is the 
construction of couplings for which the distance between the processes is deterministic. Using Stroock's repre- 
sentation of the spherical Brownian motion, we construct three different couplings, as mentioned in the introduc- 
tion. In the first one the distance is decaying at an exponential rate, in the second one the distance is increasing 
to the diameter of the sphere S 2 at an exponential rate, and in the third one, which is the most interesting and 
intriguing, the distance is constant in time. 

We collect the results on the first two couplings mentioned above in the following result, and then treat sepa- 
rately the latter one. 

Theorem 4. Fix two points x, y G S 2 with y ^ ±x, and consider the spherical Brownian motion X t on S 2 given by (2.1) 
with X — x. 

a) Let Y t be the solution to 



Jo 

where R s = Rx s .y s is the rotation matrix with axis u s = X s x Y s and angle 9 S equal to the angle between X s and 
Y s (and R s = ±1 ifY s = ±X S ). Then Y t is a spherical Brownian motion on S 2 , and 



4. The 2-dimensional Sphere case, the extrinsic approach 



(4.1) 




(4.2) 



In particular, d (X t ,Y t ) = 2 arcsin 
b) Let Y t be the solution to 




(4.3) 
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where R s = R x _y is the rotation matrix with axis u s = ~X S x Y s and angle 9 S equal to the angle between X s 
and —Y s (and R s = =p/ if Y s = ±X S ). Then Y t is a spherical Brownian motion on S 2 , and 



(4.4) 



Xt - Y, 



|y + a;|| 2 e-*, t > 0. 



In particular, d yX t , Y t J = n — 2 arcsin (\ \\y + x\\ e~*/ 2 ) increases exponentially fast to tt as t — > oo. 
Notice that both (X t , Y t ) and {^X u Y^j are both Markovian couplings. In fact they are diffusions on S 2 x S 2 . 
Proof. Using (2.1), we first write 

dY t - R t dX t =R t (I- X t X' t ) dB t - R t X t dt. 

Since Rt is a unitary matrix and RtX t — Y t for all t > 0, we obtain 

dY t = {R t - RtX t X' t ) dB t - R t X t dt 

= (I- (R t X t ) {RtXt)') RtdBt - RtXtdt 
= (I - Y t Y[) R t dB t — Y t dt 
= (I-Y t Yl)dBt-Y t dt, 

where B t = f Q R s dB s is readily seen to be a 3-dimensional Brownian motion. Using again Stroock's characteriza- 
tion of spherical Brownian motion, the first claim follows. 

To prove the second claim, we apply the Ito formula to the function / (z) = z'z and to the process Z t = Y t — X t . 
We get 

3 

d\\Z t f = 2Z , t dZ t +Y / d(Z l ) t . 

i=l 

Next, we'll show that ||^*|| 2 is a process of bounded variation. To do this, we write 

(4.5) dZ t = d{Y t -X t ) 

= (R t - I) dX t 

= {R t -I) {I- XtX^ dB t - {R t - I) X t dt 

= {R t - RtXtX't -1 + X t X' t ) dB t - {Y t - X t ) dt 

= MtdBt-Z t dt, 

where M t = Rt — RtX t X[ — I + X t X[. Combining with the above, we get 

3 

(4.6) d \\Z t \\ 2 = 2Z' t M t dB t - 2Z[Z t dt + d { z ')u 

8=1 

and in order to prove the claim it suffices to show that Z' t M t = 0. Notice that 

Z' t M t = {RtX t -X t )' {Rt- RtXtX't- I + X t X' t ) 

= x tKRt - X' t R' t RtX t X' t - X' t R' t + X' t R' t X t X t 

—X[Rt + X' t RtX t X' t + X' t — X' t X t X' t 
= x t - x t - x tK + X' t R't X t X' t - x tRt + X' t RtX t X't + x't - x't 
= X' t {R t + R't){XtX' t -I). 

Using the representation in (2.2) for R t , since (M x ) = {Y t X't - X t Y()' = - [u t ] x and {u t ® «*)' = {u t u' t )' = 
u t ® u t , we obtain: 



Z' t M t = 2X t 



I (c° s °tl + 1 + l Qs6t ( X t x Y t ) {X t x Y t )'^j (XtX't - I) 



= 2 cos 9 t (X'tXtX't - X't) + 2 X' t (X t x Y t ) (X t x Y t )' (X t X' t - I) 

1 + COS dt 

= 0, 
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where in the last equality we used X' t X t = \\X t \\ 2 = 1 and X' t (X t x Y t ) = (the vector X t x Y t being orthogonal to 
X t ) . It thus follows that Z' t M t = as we claimed, and therefore || Zt \ | is a process of bounded variation, given by 

3 

(4.7) d\\Z t \\ 2 = -2\\Z t \\ 2 dt + Y,d(Z l ) t . 

i=l 

Finally note that by using (4.5) we can write the last term in the above equation as 

3 

Y,d{Z l ) t =tr{M t M' t )dt 1 

and since X t is on the unit sphere (so X[X t = 1 and (/ — X t X' t ) 2 = I — X t X' t ), we can continue with 
tx{M t M' t ) = tr((i? t -/)(/- X t X' t ) 2 (R' t - I)) = tr((i^ - I)(R t - I)(I - X t X[)) 

= tr((27 - R' t - R t )(I - X t X[)) = 2tr(/ - X t X' t ) - tr((R> + R t ){I - X t X' t )) 
( = 6 - 2X' t X t - 2tv(R t {I - X t X' t )) = 4 - 2tv{R t ) + 2tr(R t X t X[) 

= 4-2tr(Rt)+2YjX t , 

where in passing to the last line we used that RtX t — Y t . Using the fact that the trace of the rotation matrix R t 
equals the sum 1 + 2 cos 9 t of its eigenvalues (recall that by construction the angle 9 t of rotation of Rt is the angle 
between X t and Y t ), we can conclude that 

tv{M t M' t ) = 4 - 2(1 + 2Y[X t ) + 2Y[X t = 2- 2Y{X t = \\X t - Y t f = \\Z t \\ 2 : . 

Wrapping things up, we obtained 

d\\Z t \\ 2 = -\\Z t fdt, t>0. 
Setting r = inf {t > : Z t — 0}, the above can be solved as an ordinary differential equation for t < r = t (ui) 
for any path lj G £1, and we obtain the solution 

(4.9) \\Z t \\ - ||y t -X t || = Wy-xWe- 1 / 2 , t < r. 

In particular we see that for any x ^ y we have Z t ^ a.s. for all t > 0, and therefore r = oo a.s. This shows 
that 

\\Y t ^X t \\ = \\y-x\\e-^ 2 , t > 0, 
which concludes the proof of the first part of the theorem. 

To prove the second part of the theorem, note that if Y t solves (4.3), then Y t := —Y t solves (4.1) with y replaced 
by — y (the process Y t starts at — y instead of y). If Ct denotes the circle on S 2 of radius 1 and passing through X t 
and Y t (since x ^ —y, by the previous proof we have that X t ^ Y t for all t > 0, and thus Ct is well defined), it 
follows that Y t = —Y t e C t for all t > 0. The second part of the theorem follows now easily from the first part 
using simple geometric considerations. □ 

We now proceed to showing the existence of a fixed-distance coupling of Brownian motions on S 2 , that is a 
Markovian coupling (X t ,Y t ) of spherical Brownian motions for which the distance d(X t ,Y t ) is constant for all 
times t > 0. 

Assume such a coupling exists, and that X t and Y t are given by 

(4.10) dX t = (I — X t X't) dB t - X t dt and dY t = (I - Y t Y[) dW t - Y t dt, 

where B t and W t are the driving 3-dimensional Brownian motions, and X = x,Y Q = y <E S 2 . 

By a result on co-adapted couplings of free Brownian motions (assuming that the coupling is co-adapted, see 
Lemma 6 in [10]), there exist 3x3 matrices Jt and K t with 

(4.11) J t J[ + K t K't = I 
and a 3-dimensional Brownian motion C t such that 

(4.12) dW t = J t dB t +K t dC t . 

The idea is now very simple. We want to find the matrix-valued processes Jt and K t such that the distance 
between X t and Y t does not change with time. The theorem below shows that such a construction is possible, and 
that in fact the resulting coupling is not only co-adapted, but also a Markovian coupling. 

Theorem 5. For any points x,y € S 2 , there exists a fixed-distance Markovian coupling of Brownian motions on the 2- 
dimensional unit sphere S 2 starting at x and y. As it turns out, the process {X tl Y t ) is actually a diffusion on S 2 x S 2 . 
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Proof. The claim is trivial if x = ±y, so we may assume x ^ ±y. 

Denoting Z t = X t — Y t , U t = I — X t X' t and Vt = I — Y t Y( (note that U t and V t are symmetric matrices, with 
Uf = Ut and V 2 = Vt), and using the above equations we obtain 

(4.13) dZ t = U t dB t ~ V t dW t - Z t dt 

= (U t - V t Jt) dB t - VtKtdCt - Z t dt. 

Ito's formula gives 

3 

d\\Z t \\ 2 = 2Z / t dZt + Y, d ^ t 

3 

= 2 (X' t - YD (U t - V t Jt) dB t - 2 (X' t - Yj.) V t K t dC t - 2 \\X t - Y t \\ 2 dt + d{Z l ) t 

<=i 

3 

= 2 (-X' t V t Jt - Y[U t ) dB t - 2X' t V t K t dC t -2\\X t - Y t \\ 2 dt + J^ 

i=l 

3 

= 2M t dB t + 2N t dC t -2\\X t - Y t \\ 2 dt + ^ d(Z l ) t 

i=i 

with M t = -X' t V t J t - Y(U t and N t = -X[V t K t . 

The fact that B t and C t are independent Brownian motions allows us to compute the quadratic variation of 
|| Z t || 2 as follows: 

\d{\\Z\\ 2 ) t = (M t M' t +NtN' t )dt 

= ((X' t V t J t + Y' t U t ) (J[VlX t + U' t Y t ) + (X' t V t K t ) (K'^Xt)) dt 

- (X[VtJtJ[VlX t + X' t V t JtU' t Yt + YlUtJ'tVlXt + YlUtU[Y t + X' t V t K t K' t Vl X t ) dt 

= (X[V t (JtJ' t + K t K' t ) V;x t + X[V t JtU' t Yt + YlUtJ'tVlXt + Y^U t Y t ) dt 

= (X[VtVlX t + X' t VtJtU'tYt + YlUtJ'tVlXt + Y{U t Y t ) dt 

= (X' t V t Xt + X' t VtJtU t Y t + YlUtJ'tVXt + YlUtYt) dt. 

Note that X[V t X t = X[(l - F t F/)X t = X[X t - X' t Y t YlX t = 1 - c\, where c t = Y^X f , and similarly F/F t F t = 
1 — c 2 . Since X' t VtJtU t Y t is a real number, it equals its transpose which is YlU t JtV t X t . Keeping in mind that 
X[Y t = YlX t = c t , we also get 

X' t VtJtUtY t = X' t (I - Y t Yl)J t (I - X t X' t )Y t = (X' t - c t Y{)J t {Y t - c t X t ) 
= X' t J t Y t - c t X' t J t Xt - CtYlJtYt + c 2 t YlJ t Xt, 

and therefore 

-^d (||Z|| 2 ) t = 2(1 - c\ + X' t J t Y t - c t X' t J t X t - CtYlJtYt + c 2 t Yl J t X t )dt. 
If 1 1 Z t 1 1 2 is to be a constant process, then its quadratic variation must be identically zero, or 

(4.14) 1 - c\ + X[J t Yt - c t X' t J t Xt - CtYlJtYt + c 2 YlJ t X t = 0, 

and its bounded variation part must also be identically zero. To see what the latter equation is, from (4.13) we gain 

3 

-2 \\X t - Y t f dt + J2 d ( Zl )t = (-2 \\X t - Y t \\ 2 + tr ((U t - V t J t ) (U t - V t J t )' + (V t K t ) (V t K t )')) dt 

i=i 

= (-4 (1 - c t ) + tr (UtU't - UtJ[Vl - VtJtU't + V t JtJ' t Vl + V t K t K' t Vl)) dt 

- (-4 (1 - ct) + tr (U t - UtJ'tVl ~ VtJtU't + V t (J t J' t + K t K' t ) V{)) dt 

= (-4 (1 - a) + tr (U t - UtJ'X - VJtU't + WD) dt 

= (-4 (1 - c t ) + tr (I t - XtX't) + tr (I t - F t F/) - 2tr (V t J t U' t )) dt 

= (Ac t - 2tr (VtJtUt)) dt, 
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which continues with 

tr (V t J t U t ) = tr (JtUtV t ) 

= tx{J t {I-X t X' t ){I-Y t Yi)) 

= tr (J t - J t X t X' t - JtYtY; + c t J t X t Yl) 

= tr ( J t ) - tr (JtXtX' t ) - tr (J t Y t Yj) + c t tr (J t X t Yi) 

= tr {J t ) - X' t J t X t - YlJ t Y t + CtYlJtXt, 

finally arriving at 

(4.15) X' t J t X t + YlJ t Y t - CtY{J t X t = tr (J t ) - 2c t . 

The above shows that we can construct a co-adapted fixed-distance coupling of Brownian motions on S 2 iff we 
can find matrices satisfying (4.11), (4.14) and (4.15). 

We can go one step further and simplify (4.14). Indeed, because of (4.15), it is easy to see that equation (4.14) is 
equivalent to 

1 - c? + X' t J t Y t - c t (tr {J t ) - 2ct) = 0. 

Consequently, the existence of a fixed-distance co-adapted coupling of Brownian motions on S 2 is equivalent to 
solving for J t and K t from the following system 

X' t J t Y t = -c 2 t + cttr (J t ) - 1 (or X' t V t JtU t Yt = cf - 1) 
X'^tXt + YlJtYt ~ CtY^hXt = tr (J t ) - 2c t (or tr {V t J t U t ) = 2c t ) 
J t J't + K t K' t = I. 

To find a solution of this system, we are easing a little bit the notations by dropping for the moment the depen- 
dence on t. Hence, given two vectors X, Y on the unit sphere we want to find two 3x3 matrices J and K such 
that 

!X'JY = ctr(J) - 1 - c 2 
X'JX + Y'JY - cY'JX = tr(J) - 2c 
J J' + KK' = I 

where c = X'Y. The first two equations above involve only J. Assuming that we can determine J which satisfies 
these equations, from the third equation of the system we can find the matrix K such that KK' = I — J J' if and 
only if J J' < I in the operator sense, i.e. £'JJ'£ < 1 for any unit vector £ E M 3 , or equivalently ||J'£|| < 1. The 
latter condition is the same as the operator norm of J' is less than 1, or || J|| < 1, since the operator norm of J and 
J' are the same. 

Assume now that X, Y e S 2 with X ^ ±Y are fixed. We can find an orthogonal matrix Ox.y such that 

Oxyei — X and Ox,y( ce i + \A — c 2 e 2 ) = Y 
where here (e$)j=i 2,3 is the standard basis in ]R 3 , 





" 1 " 




" " 




' " 


ei = 





, e 2 = 


1 


, and 63 — 

















1 



One way of choosing such a matrix Ox,y is for example by taking 

Oxx [ei , cei + \pi - c 2 e 2 , y/l - c 2 e 3 ] = [X, F, JT x Y] , 

where [X, V, Z] denotes the matrix whose columns are the vectors X, Y, Z. It is worth mentioning that if the matrix 
Ox.y is to be orthogonal, then it has to map 63 into an unitary vector which is collinear to X x Y, which in this 
case gives Ox,Y e 3 = ±^75=1^ x Y, so there are essentially two choices for the matrix Ox,y- 
Computing the inverse 
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we obtain an explicit formula for Ox,y 
(4.17) O x ,y = 



x\ yi x 2 y3 - y 2 x 3 
x 2 2/2 £32/1 - 2/3^1 
X3 2/3 £i2/2 - 2/1^2 



1 - 





vp 







Vi 3 ^ 2 



%3 



-CXi+yx .r 2 l/;i- yi-.r.i 



Vl-c 2 


Vi- 


-c 2 


-CX2+V2 


B3V1- 




\/l-C 2 


Vi- 


-c 2 


— CX 3 +U3 


•I'l.i/2- 


-yi-j'2 


Vl-c 2 


Vi- 


-c 2 



Note that since X V= ±Y, c ^ ±1, so the matrix Ojf,y is well defined. 

Finding a solution J to the system (4.16) is equivalent to finding a solution 

J = 0' x y JO x ,y and K = y KO x ,y 

to the system obtained from (4.16) by replacing X by ei, and F by cei + yi" — c 2 e2, which becomes 

'ce'j Jei + \/i - c 2 eiJe 2 = ctr(J) - 1 - c 2 
ei Jei + c-y/l - c 2 e' 1 Je 2 + (1 - c 2 )e' 2 Je 2 = tr(J) - 2c 
JJ' + KK 1 = I. 

Now let 

tti a 2 Qf3 

J= A ft ft 

7i 72 73 

which turns the first two equations of the above system into 



(4.18) 



VT- c 2 a 2 - c/3 2 - C73 = -1 - c 2 
cv 7 ! - c 2 a 2 - c 2 ft - 73 = -2c. 



This is a system of two equations with three unknown which can be reduced to 



ft 

73 



c 

C. 



Of course the case c = needs to be treated separately in which case, it is obvious that a 2 = — 1 and 73 = 0. 

In the case c 7^ 0, the simplest matrix J which satisfies the above conditions is the one whose entries are all 
except for a 2 , ft, and 73, so we may try 



J 



a 2 

q Vl-c 2 q 2 + l 
0c 



The main restriction now is that we want the operator norm of J to be at most 1. Because of the block diagonal 
structure, this is equivalent to 



a + 



(VT^a 2 + lf 



< 1 



or 



(a a + y/l - c 2 ) 2 < 0, 
whose solution is a 2 = — \/l — c 2 , and consequently 



J = 



Vl - c 2 
c 
c 



This matrix now is well defined also for c = and is consistent with the solutions provided by the system (4.18). 
For the above choice of J we need to find K such that 



J J' + KK 1 = L 



which reduces to 



KK 1 



z\J\ — 6 
1-c 2 
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There are several possible choices here, one of them being 



r o c o 



k= o yr^? o 



_ v/l-c 2 _ 
Going back to initial problem, we obtain the solution 

J = O x .yJO' x y and K = O x , Y KO'^ 



The only possible problem with this choice of the matrices J and K is that when the particles X and Y get close 
or antipodal (X = ±Y), the above matrices are undefined because Ox,y does not. However, this does not happen, 
since by hypothesis x ^ ±y, and with the above choices of J and K the Brownian motions X t and Y t are at a 
fixed-distance (the initial distance). 

Finally, since (X t , Y t ) solves a stochastic differential equation and the matrices Jt and K t are actually functions 
of (X t , Y t ), this means that the process (X t , Y t ) is in fact a diffusion on S 2 x S 2 . This is in fact stronger than mere 
Markovianity. □ 

Remark 6. It is tempting to extend this argument to higher dimensional spheres. If we follow the same argument we do not 
have to change anything up to (4.16). The attempt on solving (4.16) was based on arranging the vectors X, Y in a certain 
position, in other words, make X for instance to be ei and Y a linear combination of e x and e 2 . Since there is essentially (up 
to a sign choice) a unique perpendicular unit vector to both X and Y, the condition that Ox,y sends this into e 3 determines 
the matrix Ox.y perfectly well. In higher dimensions this becomes an issue because there is no canonical choice of the matrix 
Ox.y- Indeed, given two vectors X, Y it is not clear that one can produce a number of vectors which depend smoothly on 
X, Y and be a basis of the orthogonal complement of the span of X, Y. In more abstract terms, ifVk^n is the Stiefel manifold 
of k orthogonal frames (k > 2) in W 1 , then our problem becomes equivalent to the problem of finding a cross section of the 
projection Vk. n — > V 2 . n - The projection used here is sending the frame fa, fa, ■ . ■ , fk into fa , fa. It is know that this is possible 
(cf. [8, Theorem 1.7]) if and only ifn — 3 and k = 3, and this shows that the proof above works essentially only for the 
2-dimensional sphere. 

For the higher dimensional spheres, we are going to use a different approach. So far, we have only used the extrinsic 
approach which is very versatile in the present context, but could become a weakness when one wants to extend it to other 
manifolds. 



From the geometric point of view, the constructions in the previous sections are extrinsic, which means that 
the manifold in discussion (S 2 ) is imbedded into another manifold (M 3 ). The intrinsic point of view is to study 
the manifold with reference to its own Riemannian structure and not by embedding it into another one. Since the 
Brownian motion on a manifold is essentially an intrinsic object, it is natural to try to find couplings which are 
defined in terms of the intrinsic structure. This is what we want to achieve in this section. 

We start with a e?-dimensional Riemannian manifold M and we will use the notations introduced in Section 4. 
On M, one can construct the Brownian motion as the solution to a martingale problem associated to the Laplacian. 
For more details, we refer the reader to [7] and [14]. 

What we intend to construct is a coupling following the line of ideas from [7, Section 6.5] where the mirror 
coupling is discussed. Keeping the notation from [7], we want to define the coupling as a solution to a certain 
stochastic differential system at the level of orthonormal frame bundle. If Uo is a given orthonormal frame bundle 
at xq and Vq = O X0:V0 Uo is an orthonormal frame bundle at yo, the system we consider is 



where IiS is an isometry from T X M into T y M to be chosen later. Notice that in order to ensure that B t is also a 
Brownian motion on K™, we need to make sure that the map XyV is actually an isometry. This in turn guarantees 
that X t and Y t are Brownian motions on the base manifold M starting at xq and yo. In what follows we use the 
notation 



5. An Intrinsic Approach on Manifolds of Constant Curvature 



(5.1) 



< 



'dU t =J2tiHi(U t )odWi 
dV t =Y? l=1 H l {V t )odB\ 
dB t = V t - 1 Xt Y t U t dW t 
X t = nUt 



Y t = irVt 



(5.2) 
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which is the projection of the vector Hi(Ut) onto the base tangent space Tx f M whose common interpretation is as 
the parallel transport of £/ ei along the Brownian path X from T Xo M to Tx t M. 

The generator of the diffusion (X t ,Y t ) up to the first time t for which Ox t ,Y t becomes undefined is given by 

^x.y — ^x ~t~ ^y ~t~ ^(^,y-^ii 

where Ei is an orthonormal basis at x and Fi is an orthonormal basis at y. As one can check this is consistent 
because the definition of C x . y does not depend on the choices of the orthonormal basis Ei and Fj. Perhaps a little 
better suited to Ito's formula is the writing 

n 

£x,y — / ^X-Ej + O x .yEi) 2 
i=l 

with the understanding that the action of Ei on a function f(x, y) is with respect to the variable x, while XtV Ei 
acts on / as the derivative with respect to y. 

The first natural thing to look at in the context of couplings is the distance function between the processes. 
Thinking of d(x, y) as a function of two variables and using Ito's formula for the semimartingale decomposition of 

p t = d(X t ,Y t ), one obtains 

(5.3) d Pt = ((e<(i) + Ox t , Y Mt))p)(Xt,Y t )dBl + l -{Cp)(X u Y t )dt 

which is well defined up to the first time t when either X t , Y t get at each other's cut-locus or Ox t ,Y t ceases to be 
defined. As it is known, e.g. [7, Theorem 6.6.2], for the mirror coupling we have a better result, namely, 

d Pt = {(ti(t) + Ox t ,YMt))p)(Xt,Y t )dBi + ^{Cp){X u Y t )dt - dL t 

where the process L t is a positive non-decreasing process which increases only at the cut-locus. 

In our framework this should follow in a similar manner as the argument of the proof of [7, Theorem 6.6.2] for 
the mirror coupling. However, instead of appealing to an extension of this argument, we want to use a soft version 
of that argument for the construction of the coupling here. Assume that we have a smooth choice of O x , y for all 
x, y which are not at each other cut-locus. 

The key step in the definition of the process L t above is the following. Take a small e > and let C e be the e- 
neighborhood of the cut-locus Cut. Then, if we start the coupling from two points which are within the injectivity 
radius and run it until it hits C € , and then we continue the process with two independent copies until the joint 
process exits C^- At this point we resume the coupling given above until it hits again C e , at which point we 
switch again to two independent Brownian motions until the joint process leaves C2 £ , and so on. The main point 
in the proof of the existence of L t is to show that one can let e go to zero and that the time spent in C2 e goes to 0. 
We do not need this refined result here, we just need a process which continues to have the marginals as Brownian 
motions, and outside of a small neighborhood of the cut-locus runs according to (5.1). 

We will refer to this construction here as the extension beyond the cut-locus and we will use it from now on as soon 
as the rotation matrix O x . y is defined up to a complement of C e for some e > 0. Notice that with this construction 
we do not need to define O x%y for (x, y) € C c . Also, it can be shown that this construction is well defined for 
all times t > if XtV depends smoothly on x, y outside C t . For instance, the Brownian motion spends a certain 
amount of time inside C e due to the curvature condition, and also the coupling spends a certain amount of time 
outside C e , and such it is not hard to check that the coupling does not explode in finite time. This argument is 
relatively standard and we do not insist on it. 

Going back to the equation (5.3), we notice that the first variation formula provides us with a way of computing 
the martingale part via 

(5.4) (Mt) + Ox t ,YMt))p)(XuY t ) = (0 Xtl YMt)MPt)) - (*(*), 7t(0)>, 

where jt is the minimizing geodesic curve joining X t to Y t and running at unit speed. If we want the coupling to 
have a deterministic distance, we need the martingale part to be and thus we need to enforce that 

(5.5) (Ox tlY Mt)MPt)) ~ (ti(t),U0)) = o. 

Since Zi(t) is an orthonormal basis at Tx t M, this means that the orthogonal transformation Ox t ,Y t preserves the 
tangential and vertical spaces in the geodesic direction along -f t ■ 

The identification of the bounded variation part of (5.4) is also pretty standard and follows the same proof as 
the one in [7]. The result is that 

d 

(5.6) {C P )(X tl Y t )=Y J A-h,'h), 

i=l 
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where X is the index form defined in (2.4) and Ji is the Jacobi field along the geodesic 74 with the boundary 
conditions ti(t) at 7(0) and Ox t ,Y t ti{t) at j(pt)- 

In fact, the sum in (5.6) is based on the initial values of the Jacobi fields J, which are given in terms of 6j. It 
turns out that if we pick any orthonormal basis at Tx t M, say Ei, and let Ji be the Jacobi field Ji determined by 
the condition that at the endpoints equals respectively Ox t Y t Ei, then the sum in (5.6) does not change. This 
observation is useful for later computations. 

Among the candidates to the role of the isometry O x<y one is the parallel transport along the minimizing geo- 
desic from x to y, and the resulting coupling is called synchronous coupling. 

The other choice which fits into the picture is the one when O x . y preserves the tangential component of the 
geodesic from x to y, but changes the sign of the vertical component after performing the parallel transport. Ge- 
ometrically this is a version of perverse coupling and we will refer it so. With this choice, perpendicular to the 
geodesic the particles move in opposite directions. 

More precisely, let r x ,y be the parallel transport of T X M into T V M along the minimizing geodesic 7 and let 
T X M = R7(0) + T^y be the orthogonal decomposition of T X M into the geodesic direction and the perpendicular 
direction. Similarly let T y = M/f(p) + T±, with p = d(x, y). The two choices described above are given by 

(5-7) O Xi yj(0) = j(p) and XiV ^ = t X:V £_, 

respectively by 

(5-8) 0^7(0) = j(p) and O x ^ = -t X)3/ £, 

for any £ e T xy . 

The first result of this section summarizes the main properties of the coupling in the case of constant curvature 
manifolds, as follows. 

Theorem 7. Let Mbea d-dimensional Riemannian manifold of constant sectional curvature r. For simplicity consider only 
the cases r = — 1, or 1, the general case following by a scaling argument. 

If the starting points x , yo are chosen such that p a < i(M)/2, then the following hold. 

a) For the choice of O x ^ v as in (5.7), the coupling of the Brownian motions satisfies the property that 

!ifr = -1, p t > poforallt > 
z/r = 0, p t = poforallt >0 
ifr = 1, < p t < Ce - ^ 1 )*/ 2 for all t > and some constant C > 0. 

b) For the choice of O x , y as in (5.8), in all cases, 

(5.10) p t > Poforallt >0. 

Moreover, in the case of the model spaces, namely the hyperbolic space (r = —1), the sphere (r = 1), and the plane (r = 0), 
for any starting points xq 7^ yo which are not at each other's cut-locus, the following hold true. 

c) For the choice of O x ^ y as in (5.7), 

!ifr = -1, p t = 2arcsinh(e( d - 1 )/ 2 sinh(p Q /2)) for allt>0 
ifr = 0, p t = p forallt>0 
ifr = l, p t = 2 arcsin(e-(' i - 1 )*/ 2 sm(p /2))for all t > 0. 

d) For the choice of O x ^ y as in (5.8), 

{ifr = -1, p t = 2ai'ccosh(e (d ~ 1)t/2 cosh(p /2))/or allt>0 
ifr = 0, p t = V 'p'o + 4p - l)i for all t > 
ifr = 1, p t = 2 arccos(e~ (d ~ 1)t/2 cos(p /2)) for all t > 0. 

Remark 8. In the case of the sphere S 2 , the construction in the above theorem matches the one in Theorem 4, but also covers 
the case of spheres in all dimensions, and has the virtue of being intrinsic. 

Proof. We need to compute the expression from (5.6), which is ultimately reduced to the computation of the index 
forms. Fix two points i/j/ which are not at each other cut-locus. 

Take a choice of an orthonormal basis Ei, i = 1 . . . , d at x with E\ = 7(0) and use the parallel translation along 
the geodesic joining x to y to extend it to each point of the geodesic. Then from (2.6) and the formulae following 
this in Section 4, plus some straightforward computations, reveal that the equation for the distance becomes 

(5.13) dp t = -(d - l)^ta,n{sjrp t /2)dt, 
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which is valid for times t up to the time when (X t , Y t ) hits the set C e , for e > chosen sufficiently small. 

In the case r = 0, this implies that p t is constant for all times. In the case of r = — 1, we have that up to the time 
when (X u Y t ) hits C e , 

dpt = (d - 1) tanh(/9t/2)dt. 

In the case of the hyperbolic space there is no cut locus, so this equation solves as in (5.11). 

In the case of other manifolds of curvature —1, we may still have to deal with the fact that the cut-locus is 
non-trivial, and we reason as follows. First, equation (5.13) shows that p t increases until the time when (X t ,Y t ) 
enters C £ (e small enough). Then the motion moves independently until (Xt, Y t ) leaves Ci t , and all this time the 
distance is certainly greater than half the injectivity radius, and thus is larger than po. Then the processes resume 
the coupling and the equation above shows that the distance increases again. Overall, in both cases the distance 
stays greater than p . 

In the curvature r = 1 regime, the equation (5.13) becomes 

(5.14) dp t = -(d-l) tan(p t /2)dt. 

The solution is given by (5.11), at least up to the time when the processes get very close to each other's cut-locus. 
In the sphere case it is clear that X t and Y t never get at each other's cut locus, so we do not have to worry about 
the cut-locus. In other cases of constant curvature r = 1, by Mayer's theorem the diameter of M is no greater than 
7T, so the tangent in the above formula is always non-negative, and thus the distance is decreasing and so we do 
not have to worry about getting close to the cut-locus. Then the equality dp t = —(d — 1) t&n(p t /2)dt implies that 
Pt < Ce'^' 1 ^ 2 and also that p t + 0. 

Now we move one to the second choice of IiS from (5.8). The changes are that the index form is computed 
(again from (2.6)) as 

AJ,M = (y-t(vW2), r/o 



. Pt ■ 



r = 



which give in turn the equation for p t as 



dpt 



(d - l)Vrcot(Vr/>t/2)dt, r + 
&=£dt, r = 0, 



up to the time when (X t , Y t ) get close to the cut-locus. 
When r = 0, in the particular Euclidean case we get 



Pt = JpJ+Md^Tjt. 



In the case when there is a cut locus and the points X t , Y t get at each other's cut locus, we can argue as before that 
the distance pt increases, and then eventually we have to switch to the independent Brownian motions inside C^e, 
and then resume the coupling. At any rate, as soon as the distance becomes less than i(M), p t starts increasing 
again, so it will never be less than the initial value po- 
In the case of negative curvature r = — 1, we obtain 

dp t = (d-l) coth(p t /2)dt, 

and in the hyperbolic case the solution is given in (5.12). If the cut locus is not empty, then we argue as we already 
did for the flat case. 

Finally, for r = 1 the equation is 

dp t = (d - 1) cot(pt/2)dt, 

and the solution is given by (5.12) in the case of the sphere, and in the general case we can argue as in the flat 
case. □ 

6. Shy and Fixed-Distance Couplings on Riemannian Manifolds 

In this section we prove a general result about the existence of shy coupling on Riemannian manifolds. Before 
we launch into various technical details, we state the main result of this section. 



Theorem 9. Let M be a d-dimensional Riemannian manifold such that for some real number k we have 
(6.1) Ric x > k for all x £ M and sup K x < oo, 

where Ric is the Ricci tensor and K x stands for the maximum of the sectional curvatures at x e M. 



16 



MIHAI N. PASCU AND IONEL POPESCU 



a) There exists e > such that for any points xq, yo £ M with d(xo, yo) < ewe can find a Markovian coupling of 
Brownian motions X t , Y t starting at xq, yo such that d(X t , Y t ) > d(xo, yo)fo r £ > 0. 

b) If in addition k > 0, then there exists e > such that for any xo,yo G M with d(xo,yo) < e, there exists a 
Markovian coupling of Brownian motions X t ,Y t starting at xo,yo with d(X t ,Y t ) — d(xo, yo)for all t > 0. 

c) Moreover, ifk>0, then there exists e > such that for any x ,y € M with d(x , y ) < e, there exists a Markovian 
coupling of Brownian motions X u Y t starting at x , y with d(X t ,Y t ) = d(x , yo)e~ kt for all £ > 0. 

The plan of the proof is as follows. First we set up an extension of the orthonormal frame bundle (which will be 
used in the case of even dimensional manifolds). Then we define the equation of the coupling at the level of this 
frame bundle and we seek a local solution. Once we show the local existence of the coupling, we use patching in 
order to prove the global existence of the coupling. 

We split the proof into several subsections. 

6.1. TV -frames and the associated bundle. One of the constructions of the Brownian motion on a e?-dimensional 
Riemannian manifold uses the notion of orthonormal frame bundle. We first extend this notion by introducing the 
following. 

Definition 10. Let N > dbe an integer number. An N-frame U in T X M is a map U : R N — > T X M such that UU' = Id. 
Alternatively, U is an N-frame at T X M if the map U' is an isometric imbedding ofT x M into R N . 

Using an abuse of language we often say that U is an TV-frame at x rather than in T X M. 

Another way of describing U is via the vectors Xj = Uei, i = 1 . . . N, where e, are the standard basis vectors in 
M. N . The condition that U is an TV-frame is actually equivalent to the condition that 

N 

(6.2) X i) X i = € fOT a11 £ G T - AL 

i=l 

Indeed, if X t = Uti, then 

N N 

Xi)Xi = UJ2(U'Z, e t )e t = UU^ = £ for all £ € T X M. 

i=l i—1 

Conversely, the condition (6.2) determines an TV-frame U : R N — > T X M by prescribing 

N 

Urj = ^2{q,ei)Xi, 

i=l 

noting that 

N 
i=l 

which under (6.2) gives UU' = Id, as needed. 

Hence we have different characterizations of an TV-frame, as a projection, as an isometric embedding and as a 
set of vectors [7ej. 

Suppose we have two points x,y 6 M, an TV-frame {Xi}f =1 at x, and an isometry A : T X M — > T y M. Then 
{AXi}f =1 is certainly an TV-frame at y because 

N N 

^,AXi)AXi = A^2(A'£,Xi)Xi = AA'Z = 

i=i i=i 

Also, it is easy to see that if O is an orthogonal transformation of R w and U is an TV-frame, then UO is also an 
TV -frame. This in fact defines the action of O(N) on the fiber 0{M) X , which is the set of all TV-frames at x. Now we 
can construct the bundle of TV-frames denoted by O n (M), and for simplicity we will drop the superscript TV. It is 
clear that 0(M) is a smooth bundle over M and ir : O(M) —> M which assigns to each TV-frame U in T X M its base 
point x (i.e. ttU — x) is a smooth map. 

In the terminology of differential geometry, 0(M) is actually a fiber bundle with the fiber being the Stiefel 
manifold Vd.N constructed from the trivial principal bundle M x 0{N) over M. 

For each fixed U at T X M, the tangent space TjjO(M) splits into the horizontal part TffO{M) obtained by lifting 
tangent vectors from T X M and the vertical part T^fO(M) which contains a special class of tangent vectors obtained 
by differentiating curves which are determined by the action of O(N) in the fiber. For references the reader can 
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consult [7] or [14] (the discussion there is intended for the orthonormal frame bundle, but nevertheless most of it 
extends naturally to this context). 

Now, we define the fundamental vector fields Hi on O(M) by the prescription that at each U, (Hi)u is the lift of 
the vector Uei from T^jjM. The main property here is that the associated Bochner Laplacian 

N 
i=l 

projects down onto M as the Laplace operator. The proof is as in [14, Section 8.1.3], and for simplicity we just point 
out the main difference. For a vector £ e M. N , let (H^)u be the horizontal lift of U£ at U. Then with the same proof 
as [14, Equation 8.30], for any smooth function / on M we have 

(Hs)u o H v (f o tt) = ((Hess/UE^, Ur,), 

where Hess/ is the Hessian of / on M. Once this is established, we can continue with 

N N N 

J2(Hi)uHi{f o tt) = ^((Hcss/^f/e,, Ue,) = ^([/'(Hess/)^^, a) 

i—1 2=1 2 — 1 

= ti(U' {Kess f^uU) = ti{(Ressf) wU UU') = tr((Hass/) ltr ) 
= (A M f)(nU), 

where we used that the Laplacian on M is simply the trace of the Hessian. Thus 

(6.3) tt.Ab = A M . 

Under the assumptions in (6.1), the Ricci curvature is bounded from below from which we learn that the Brow- 
nian motion on M does not explode. Thus the Brownian motion constructed on O(M) (more appropriately the 
solution to the martingale problem for As) projects down into the Brownian motion on M and exists for all times. 

6.2. The Coupling SDE. Now we want to couple Brownian motions on M, and for this matter we consider cou- 
plings of a similar form to the one in the previous section. More precisely, for given points Xo,yo € M and 
N -frames Uq at xq and Vq at yo, consider the system 



(6.4) 



dU t =J2f =1 H l (U t )odWi 
dV t =J2? =1 H t (V t )odBl 



dB t = Ut ,v t dW t 
X t = nU t 
Y t =vV t . 

Here Wt is an TV-dimensional Brownian motion and Ou.v is an orthogonal N x N matrix which depends smoothly 
on U, V, at least on a subset of 0{M) x 0{M) which will be specified later on. This insures that B t is also an 
TV-dimensional Brownian motion. We do not impose additional conditions on the matrix Ojj t ,v t yet. 
The same arguments as in [7, Section 6.5] show that the generator of the diffusion (Ut, Vt) is given by 

N 

A c = Ab.j + A B ,2 + 2 H e*,2Hi, 



i=l 



where the subscript 1 or 2 represents the action with respect to the first or the second variable, and e* = Ouye^. 

Let p t = d(X t , Y t ) be the distance function between the processes X t and Y t and let p(t) = d(nUt,TrVt). Also let 
d(U, V) = d{irU, ttV) be the lift of the distance function from M into O(M). Using Ito's formula we have that 

(6.5) dp t = (iH itl + H e * i2 )d) (U t ,Vt)dW t + \ (A c d) (U t , V t )dt, 

which is certainly valid in the region where irllt and %Vt are not at each other's cut-locus. 
The first variation formula gives 

(H itl + H e *. 2 )d(U,V) = (f/e i ,7x,y) 7r y ™ (VO u ,v^h'Yx,y)wU, 

where X = nil, Y = irV, and 7x,y is the minimizing geodesic joining X to Y, run at unit speed. The bounded 
variation part comes from the second variation formula and produces 

N 

(6.6) (A c d){U,V) = Y J A^Jr), 
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where Ji is the Jacobi field along the geodesic joining ttII to ttV, with values Uei, VOu.vei at the endpoints. 
In order to cancel the martingale part from (6.5), we need to impose the condition 

(Uei,jx,Y)^v ~ (VOuye^jx^TrU = 0. 

6.3. Local Construction. This part of the proof consists in showing that there exists r\ > sufficiently small such 
that for any x, y € M with d(x, y) < 77 there is a smooth choice of Ouy or\J\f ri (x,y) = 7T~ 1 (B(x, 17)) x n~ 1 (B(y,r])) 
for which 

(6.7) (Ue i , A f n u ! i r v)nU - (VO U ye l , A f 7rUiJrV )^v = for (U,V) &J\f n {x,y) 
and 

N 

(6.8) ^Z{Ji, Ji) >0, for([/,F) eAf v (x,y), 

i=l 

where are the Jacobi fields with boundary values Uei and V^Oy.ve,; at the endpoints of the geodesic joining nU 
and irV. Note here that for small r\, there is a unique geodesic joining ttU and 71- V, so everything is well defined in 
this case. 

Take 77 < i(M)/3, where i(M) is the injectivity radius of M. In fact we are going to choose possibly smaller 
values of 77 later in the construction, but for now assume that it is smaller than i(M )/3. The fact that the injectivity 
radius is positive follows from the hypothesis that the sectional curvature is bounded from above (see for instance 
Proposition 2.4 from Chapter 10 of [4] and the discussion following Lemma 11 below). 

Now, assume that xo,yo € M are two fixed starting points with distance d(xo, yo) < r\. We will construct the 
coupling (U t , V t ) mJ\fr,(xo,yo). 

We can choose an orthonormal basis E\, E2, . ■ . , Ed at x such that E\ = 7x^(0) and such that each Ej depends 
smoothly on (x, y) 6 B(x , ~q) x B(y , ?]). We can extend this basis E\, . . . ,Ed along j XiV and continue to call it 
Ex,..., Ed- Now, condition (6.7) becomes 

(6.9) U% tV - 0' UiV V'j Xiy . 

Next, let us denote Jij the Jacobi field along the minimizing geodesic joining ttII to ttV such that it equals Ej at 
irU and at irV . Similarly let J2 j be the Jacobi field which is at irU and Ej at ttV . Then, since 

d d 

Ji = ^{Ue i ,Ej)Ji t j+ y £ i {VO uy e i ,Ej)J % j 
j=i j=i 

it follows that 

N d d d 

(6.10) Y. 1 ^ Jl ) = Y I K-l. ■ Y^l(-l'.--h.r + 2 (°u,vV'Ej, U'E k )l(Ji d , J 2 . k ). 

i=l j=2 3 =2 3,k=2 

The expression given by the last sum can be simplified as follows. Let T x>y stand for the parallel transport map 
from T X M to T y M along the minimizing geodesic 7 X y . Consider the bilinear map A x y : T X M x T X M — >• R defined 
by 

where Ji^ is the Jacobi field along j x>y which is ^ at x and at y, and Ji,^ is at x and T x , y r\ at y. Another way of 
looking at this is as a linear map from T X M into itself, map which we still call A X)3/ . We can see this map also as a 
linear transformation preserving the orthogonal to 7^ at x and we will denote this restriction also by h x , y . In fact 
the actions of K XtV and its transpose on j x _ y are zero. 

With this notation, it is not hard to see that for A-frames U and V at x, respectively at y, we have 

d 

(6.11) Yl (°'u,vV'Ej, U'E k )l(Ju,J 2ik ) = ti(UO' UtV V'T XtV A x , v ). 

3,k=2 

For the first part of the theorem we want to find a map Ojj.v such that the quantity in (6.10) is positive. In order 
to do this we first show that for each j = 2, . . . , d we have 

(6.12) l(J ltj ,J ld )>0 and 1{ J 2J , J 2 ,j) > 0, 
and consequently 

d d 

(6.13) E Z ( J ^> J i^) + E Z ( J ^' J ^ > °- 
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To carry this task through, we are going to use the following standard comparison result, whose proof can be 
found in [4, pp. 216-217]. 

Lemma 11. Assume that M and M are two manifolds and 7, 7 are two normalized geodesies defined on [0, p] such that 
7 does not have conjugate points. Assume that J t and J t are two Jacobi vector fields along 7, respectively 7, such that 
Jo = Jo = 0, \ J P \ = \ J p \,and 

K+Mt))<K-tf(t)), 

where K + (x) is the maximum of the sectional curvature at x and K~(x) is the minimum of the sectional curvature at x. 
Then we have 

(6.14) 2(J,J)<2{J,J). 

Since the sectional curvature is bounded from above, K x < l/r 2 for all x G M for a small enough r, and for 
reasons which will become apparent immediately we take r < (1/4) 1 / 3 . In turn, this shows that the injectivity 
radius is at least m (see [4, p. 218]). 

With this, for points x, y e M at distance p = d(x, y) < irr, comparing the index form of the manifold M with 
the index form of a sphere of radius r, for geodesies of length p < ixr, we obtain 

1(JJ) <X(J,J), 

where J, J are as in the Lemma 11. On the other hand, for the d-dimensional sphere S d we have J(s) = W2{s)E(s), 
where w 2 is given by (2.6) and E is the parallel transport of E Q e T^ / ^ - ) S d along 7. From (2.5) we conclude that 

1(J, J) = Wzifi) = y/r cot(\/r-p) 
and consequently, because we choose r < (1/4) 1 / 3 , we obtain 

(6.15) < y/r cot( v / r i o) = X(J, J) < 1(J, J) 
which is the claim of (6.12). 

We now choose 77 sufficiently small, for instance smaller than r above, and therefore also less than a third of the 
injectivity radius of M. 

For the first part of the theorem it suffices to choose Ouy such that 

U'%,y = 0' UV V'j x ,y 

and 

tT{UO' u>v V'T x , y K, y )>-- [^I(J lj ,J lj ) + ^I(J 2j ,J 2j )] . 

y=2 j=2 / 



To do this, first we choose a matrix $> XtV which is close to A x>y in the Hilbert-Schmidt norm, more precisely 

\\^x:y - ®x,y\\tJS 



\J=2 j=2 J 



and such that tr ($ x ,y) 7^ 0. It is not hard to check that it is always possible to choose such a matrix Q x . y . Now in 
the basis E± = j x , y ,E2, ■ ■ ■ , E d we can take 

fj = U'Ej and hj = V'E h j = 1, . . . , d. 

We treat separately the cases of odd and even dimensional manifolds, as follows. 

Case I: d is odd. In this case we take N = d, so we are back to the classical situation of the orthonormal frame 
bundle. Let Ajj and Ay be the (unique) orthogonal matrices which send ej into fj, respectively ej into hj, j = 
1, . . . , d. We will choose the matrix Ojj,v such that 

A l u O' uy A v e 1 = ei and ti{A! v C/ U)V A v Q XM ) = 0, 

where 3> X)3/ is the matrix acting on the last d — 1 coordinates of R d and having the same matrix representation as 
the matrix A XAJ in the E2, ■ ■ ■ Ed basis, in other words ($> XtV Ej, Ek) = (A XiV ej, e/-). We pick an orthogonal matrix 
B x , y such that 

B x , y ei = ei and tr(B x ^ y A Xyy ) = 0, 

and with this we take 

Ouy = AyB'xyA'jj. 
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To get to terms with the matrix B XtV , we choose it to be given in matrix form by 



(6.16) 



B 



1 


























cos a 


sin a 




















— sin a 


cos a 


























cos a 


sin a 




















— sin a 


cos a 































cos a 


sin a 




















— sin a 


cos a 



This is where we actually use the fact that the dimension d is odd: in the above representation we use on the 
diagonal (d — l)/2 blocks of 2 x 2 unitary matrices. With this choice, we clearly have B x>y ei — e\, and it remains 
to pick a. 6 [0, 27r] such that tr(B X}V A Xiy ) = 0, or equivalent 



cos(a)tr(A 2;!a ) + sin(a)F Xjy = 0, 



where F x , y = 2\^i=2 (Ax,v e i) e *+i) — Y?n=z(^-x,y e i> e i-i)- Because tr (A^) ^ 0, the above equation has a unique 
solution in [0, 27r) which depends smoothly on x, y. 

Case II: d is even. In this case we use N = d+l. The difference from the previous case is that this time we consider 
the matrix An which sends e,j into fj,j = 1, . . . , d, and the vector ed +1 into the exterior product of /i, . . . , /<j, and 
we define the matrix Ay in a similar fashion. Note that with this choice, Ajj and Ay are orthogonal matrices. 
The rest of the argument is now the same argument as in the case when d is odd, with the choice of Buy as a 
(d + 1) x (d + 1) matrix such as the one in (6.16). 

Let's wrap up the main findings of this subsection. We showed that there exists a matrix Ou.v which depends 
smoothly on (U, V) € Af v (x 0l yo) such that (6.7) and (6.8) are satisfied. In fact we proved that for small enough r\ > 
0, as long as the distance between xq and yo is less than ?//2 and the process (X t , Y t ) stays inside B(xq, rj) x B(y , rj), 
the distance function satisfies 



(6.17) 



dp t > a cot(&p t ), 



for some positive a, b > 0, which depend only on r or for that matter only on the upper bound on the sectional 
curvature. 

In the case of non-negative Ricci curvature, Ric > k > 0, we can continue with a choice of Ojj.v which satisfies 
a similar property to (6.7) and (6.8) as follows 

(6.18) {Uei,%u^ v )7vU - {VOu,vei,%u,-wv)-KV = for (U,V) eN v (x,y) 

and 

JV 

(6.19) 



Y, X Vi, Ji) = ~kd( Xl y), for (U, V) G M v {x, y). 



To show this, we first recall the following standard result in Riemannian geometry. 

Lemma 12. Assume 7 is a normalized geodesic on [0, p] without conjugate points on it. If J and V are two vector fields with 
the same boundary values, and J is also a jacobi field, then 



(6.20) 



1(J,J)<1(V,V). 



This result can be easily deduced from [4, Lemma 2.2, Chapter 10] as follows. First note that both J and V 
cancel at one end of the geodesic. However, if J and V have the same values at the endpoints, then due to the 
equation of the Jacobi field and an integration by parts, we have 

1(J, V) = (j(p),V(p)) - (J(0), V(0)) = (J(p), J(p)) - ( j(0), J(0)) = X(J, J), 
so using the index lemma for the vector field V — J and the trivial Jacobi field 0, we obtain 

< 2(V -J,V-J)= 2(V, V) - 2X(V, J) + X(J, J) = 1(V, V) - X(J, J). 

Now, 
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On the other hand, using the above comparison theorem with the vectors Ej in place of V and Jxj + J r 2 .j as the 
Jacobi field J, we obtain 

d d d p 

X^Jij + J 2>j , Jij + J2j) < Y^IiE^Ei) = J2 (\Ej(s)\ 2 - (ii(7(«), 7(*)>) ds 

j=2 5=2 3=2 J ° 

•P 

Ric lis) (j(s) 1 A f(s))ds < -kp 



where p = d(x, y), and therefore 

d d d 

(6.21) 2^X(J 1J , J tJ ) < -C£l(Ji,3, Ji,j) + E X ( J 2^ J 2^)) - fe P < 0. 

3=2 j=2 j=2 

What we want to show is that we can choose Ouy such that 

N d d d 

Y,l{Ji, .h) = Y,l(Ji,j, Ji.j) + ^X(J 2J , J 2J ) + 2 {UO' u<v V'E h E k )X{J ltj ,J %k ) = 0. 

i=X 3=2 3=2 3,k=2 

To do this, we try to find Ojjy as in the previous case, namely Oyy = AyB' x yA'y, with a matrix B xy as in (6.16). In 
turn, this requires to choose a such that with the choice of $u,v = ^u,v and A XiV the matrices in the corresponding 
basis, 



2 cos(a)tr (A XtV ) + 2 sm(a)F x , 



■ d d \ 

^2i(j ld , Ji (i ) + ^x(j 2j , j 2J ) j - fep, 

, j=2 j=2 / 



where F x<y = Y^i=2 (A x , y ei, e i+ i) — J2i=3(^x, y ei, e,_i). The key is that (6.21) is nothing but the statement that 

(d d \ 

^l(J hj ,Jl tj ) ■ )J /(./,,.,../, ,: - fcp < 0, 
3=2 3=2 / 

and then simple trigonometry shows that for any a < c < and any b, the equation 

cos(a)a + sin(a)6 = c 

has the solution 

a = arccos 



Vo 2 + 6 2 / VvV + fo 2 , 

Summing up the findings, we have shown that we can choose B x ^ y which depends smoothly on x, y, and thus 
Ou.v depends smoothly on U, V so that (6.18) and (6.19) are satisfied. 

6.4. The construction of the coupling. Consider first two independent iV-dimensional Brownian motions W t and 
W t . For a given stopping time r, we denote Wt, T = Wj — W T . 

We have proved that for a small enough -q > and any x, y with d(x, y) < r\ there exists a smooth choice Ojj,v on 
AT n (x, y). We will now use this to give a construction of the coupling as indicated in the statement of the theorem. 

For any 77 > we define the r/-neighborhood of the diagonal in M x M by 

D v = {{x,y) : d(x,y) < i]}, 

and we also set 

T> n = {(U, V) e 0(M) x O(M) : (ttU, ttV) e D v }. 

For a fixed pair of points (xo, yo) € D n u and frames Uq, Vo at xq, respectively at yo, we consider an orthonormal 
basis Ei, ... Ed at xq with E\ — 7a; o ,j, o (0) and extend this to a local orthonormal basis on B(xo, 2-q) and then by 
parallel transport also to B(yo, 2rj). Using the local recipe outlined above we can construct a coupling up to the 
first time t when the base process (X t , Y t ) hits the boundary of the set B(x , 77) x B(y a ,r)). Let's call this exit time 
t\. At {x\,yi) = (X Tl ,Y ri ) we have the orthogonal basis Ex,..., Ed used in the local construction, which at xx 
satisfies Ex = ix x ,y x i and Ux '■= U Tl and Vx := V Tl are the frames obtained from (6.4). 

The next step is to extend the construction of the coupling beyond the time Tx. There are two cases to be 
considered here. 

If the point (xx, yx) lies inside D^/2, we can use the starting point (xx, yx) and continue to run (Ut,Vt) following 
(6.4) using now the Brownian motion Wt, T i with the time range t > T\. As above we let r 2 be the first time the 
process (X t+Tl ,Y t+Tl ) hits the boundary of B(xx,rj) x B{yx,rj), and we set (x 2 ,y2) — {X Tl+T21 Y Tl+T2 ) and also 
U 2 = U T1+T2 and V 2 = V T1+T2 . 
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On the other hand, if the point (acj, yi) lands outside D ri / 2 , then we run the motions Ut and V* for t > t\ with 
the system 

[dU t = Yt 1 H i {U t )odWl Tl 
Uv t =Y,tiHi{V t )odWl Tl 

| X t = TTlJt 

{Y t =TrV t . 

In other words, U t ,V t run as independent Brownian motions on 0{M) x 0{M), and X t ,Y t run as independent 
Brownian motions on the base manifold M. We continue with this construction for time t in the interval [n , T1+T2], 
where the terminal time t\ + t 2 is the first time the process (X t ,Y t ) lands in D v u, and we denote (x2,y2) = 

{Xti +T-2 I _)_ T2 ). 

In both cases above we constructed the processes Ut, Vt defined up to the time n + t 2 , and (a?2, 1/2) is either 
in or outside it. Inductively, we can now repeat the construction above, to show that we can extend the 
construction of the processes for another T3 units of time, and so on. If for a certain n, r n = +00, then we certainly 
take all other stopping times r m = for m> n. 

One of the main problems is to show that the construction can be extended for all times t > 0, in other words 
that 

2J T„ = +OO. 

n>l 

We are going to do this separately for the first part of the theorem, and argue differently for the second and third 
part. 

In the first case, the idea is that as long as the process (X t , Y t ) stays inside D v n, from (6.17) we have that the 
distance process pt satisfies 

dpt > acot(bp t )dt, 

where a, b depend only on r. 

This means that if 77 is small enough and the starting distance is finite p , in finite (deterministic) time the process 
(X t ,Y t ) exits Once the process (X t , Y t ) exits the set D rj / 2 , X t and Y t run independently until they hit the set 
and then they stay in / 2 for at most a finite (deterministic) amount of time, after which they exit again 
D^/2- In particular we see that the processes X t , Y t have to run independently infinitely many times, and it is this 
fact that allows us to show that J2 n >i T " = +°°- This is done using the Borel-Cantelli lemma. 

For the moment assume that we have two independent Brownian motions X t , Y t starting at x$ , yo with d(xo , yo) = 
77/2. If t is the first time when X t ,Y t are within distance rj/4 of each other, we want to get an estimate on P(t > S) 
for some 5 > 0. To do this, we use the following inclusion 

{(x, v /i6 >6}n {( Y , v /i6 > 5} C {t > 6} 

where Cx,)j/i6 is the first exit time of X t from the ball B(xq, 77/16) and similarly Cy.jj/16 is the first time Y t exits the 
ball B(yo, 77/I6). This inclusion can be stated in words as follows. If X t and Y t stay inside B(xq, 77/I6), respectively 
B(yo, 77/I6), up to time 5, and since xq, yo are distance 77/2 apart, it follows that X t and Y t are not within 77/4 of 
each other in the time interval [0,6]. The conclusion we draw from this is that 

P(r > 6) > P(Cjc,„/ie > 5)P(Cr,„/i 6 > 6). 

Finally, since the the Ricci curvature is bounded below, we can invoke now the estimate on the exit times from 
balls, for instance [7, Theorem 3.6.1], to obtain that for any point x on M we have 

P*(C,/i« < S) < e- Cr2 / s , 

where the constant C > depends only on the lower bound on the Ricci curvature and the dimension of the 
manifold. Thus for a fixed rj > we obtain that 

(6.22) P a (G,A6 >S)>1- e^ 2/s := C 2 > 0, 

for a certain constant C > 0, and therefore 

P(r > 5) > C\. 

With this at hand we get that 

n^n >6) = +00, 

n>l 

and using Borel-Cantelli lemma we conclude that J2 n >i T « = +°°' which shows that the construction of the 
coupling extends for all times t > 0. 
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For the other case of fixed or decreasing distance coupling, the point is complementary to the previous one. 
More precisely, in the above proof it was the independent motions which played the main role, while here the 
main role is played by the coupling. To get to terms, note that if we start the coupling with points xq, yo such that 
d{xo, yo) < 77/4, then, since the distance between the processes does not increase, the process (X t , Y t ) stays in D v / 2 
up to the time Xm>i T ™- The issue is to show that this sum is always infinite. What we want to do is to find a lower 
bound on P(ri > 8). Using the same notation as above, we have 

(6.23) {Cx,n/iB >S}c {n > 8}. 

To see this, we follow the construction until either X or Y hit the ball of radius r\ centered at xq, respectively yo- 
Now, if X stays inside B(xq, tj/ 16) on the time interval [0, 8], since d(xo, yo) < 77/4 and the processes remain at fixed 
or non-increasing distance, an application of the triangle inequality shows that Y remains inside B(y , 98/16) on 
the time interval [0, 8], which in turn implies (6.23). Using again (6.22) we get that 

P(ti > 8) > C 3 > 

for a constant C3 which is independent of the starting points. Since this is applicable to all stopping times r„, we 
learn again from Borel-Cantelli lemma that J2 n >i r « = +°°- 

6.5. Finishing off. In the previous section we constructed the coupling and we proved that it is defined for all 
times. We want now to show that the construction actually does what the theorem asks for. This is already spelled 
out in the previous subsection in a certain form. 

For the first part, on each of the regions where the coupling is inside D v / 2 , due to (6.17), we see that the distance 
is non-decreasing, and therefore it is larger than the starting distance which is at most f]/A. On the other hand, if 
the coupling exits D v / 2 , then it runs as independent Brownian motions until it hits again D, n / 4 , and consequently 
the distance is at least rj /4 apart. In both regimes the distance does not get smaller than the starting distance and 
this concludes the proof of the first part of the theorem. 

For the second part, the process never leaves D, q / 2 and for all times it remains at a fixed distance. The last case, 
when the distance decays exponentially, being similar, we omit it. This concludes the proof of Theorem 9. 

7. Refinements and Comments 

The proof of Theorem 9 spreads on several pages, and some comments on it are in order. The first observation 
is that the hypothesis (6.1) in the statement of the theorem is essential for the construction. For once, the Ricci 
curvature bounded from below insures the non-explosion of the Brownian motion on one hand, and on the other 
hand it is important in the estimate of the exit times employed in the proof of the global existence of the coupling. 

That the scalar curvature is bounded from above does not seem to be optimal even though it is an important 
piece in the proof of the existence of the coupling via the index form comparison on M with the index form of 
a sphere. Geometrically, we certainly need to make sure that the Brownian motions we try to couple do not get 
trapped in regions of extremely high scalar curvature (small radius spheres) where the Brownian motions tend 
to get close to one another. It seems though that the optimal condition would be that the injectivity radius of the 
manifold is positive. However this certainly requires a different argument from the one provided here. 

Another aspect is that the global existence of the choice of the map Ojj,v is tight to the existence of a smooth 
choice of an orthonormal frame on M. On an arbitrary Riemannian manifold this can be done only locally and 
this is why we had to get one more step from the local existence of the coupling to its global existence. There are 
though a few cases when the existence can be proved globally, one of which being the case of surfaces. In this 
case, for any two points x, y not at each other cut-locus, there is a single perpendicular direction to the geodesic 
joining x and y. Using this we can show that there is a global choice of Ojj,v as long as irU, irV are not at each other 
cut-locus. 

Another case in which we can construct a global version of Ojj,v is the one in which M is parallelizable, namely 
the tangent bundle is trivializable, or otherwise put, there exist vector fields X\ , X 2 , . . . Xd which are independent 
at each point. This amounts to the existence of a global orthonormal frame bundle. It is for instance the case of S 3 
and S 7 and also of any Lie group with the left or right invariant metric. 

The couplings we constructed in Theorem 9 are defined for all times t > 0, and the conditions in (6.1) were 
necessary in the proof. There is however a case when the second condition can be dispensed of if one only needs 
the coupling to be defined up to the first exit time of the coupling from a relatively compact set. For completeness 
we record the result here and use it in the next section. The proof is basically the same as the one given above 
adjusted with a stopping time. 
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Theorem 13. Let Mbea d-dimensional Riemannian manifold and D c M a relatively compact open set ofM with a smooth 
boundary. Then, there exists e > such that for any x,y G D with d[x, y) < e, there is a shy coupling of two Brownian 
motions on M starting at x and y, defined up to the first exit time of either of the processes from D. 

If in addition Ric > 0, there also exists a fixed-distance coupling Brownian motions on M starting at x and y, defined up 
to the first exit time of either of the processes from D. 

We finish this section by pointing out that the suggestion given by Kendall in [10] for the construction of the 
shy coupling is to use the same orthogonal transformation as in (5.8) which is a form of perverse coupling (in the 
terminology of [10]). However, this is not sufficient to get the fixed distance coupling. Particularly this is very 
clearly illustrated in the case of surfaces. Indeed, since the dimension is 2, we have just one dimension left in the 
orthogonal to the geodesic joining X t and Y t and then there are essentially only two choices of an orthogonal map 
from T x to T y (for x, y not at each other cut-locus) which preserves the geodesic direction. These are the ones in 
(5.7) and (5.8). At least in the constant curvature case manifolds as we showed in Theorem 7 we do not get a fixed 
distance coupling. 



8. Applications 

As an application of the fixed-distance coupling we present a proof of the following maximum principle for the 
gradient of harmonic functions. 

Theorem 14. Let M be a Riemannian manifold with non-negative Ricci curvature and let u : M — » R be a harmonic 
function on M. Then, for any relatively compact open set D with smooth boundary we have 

(8.1) max |Vu(x)| = max |Vu(x)|. 

x£D x£dD 

Proof. Fix an arbitrary point x e D. Then there is a geodesic 7 such that 7(0) = x and 

v(x) := |Vm(x)| = lim wv ;; 

h— >o h 

In particular, for a small enough h > 0, we can consider the fixed distance coupling from Theorem 13 started at 
7(/i) and x, and run it up until the stopping time C, defined as the first time when either of the processes X t or Y t 
hit the boundary of D. On the other hand, since the function u is harmonic, u(X t ) and u(Y t ) are local martingales, 
and in fact, since u is bounded on D, we can write 

- u(x) = E[u(X ( ) - u(Y ( )]. 

The upshot of this equality is that there must be an lj in the probability space where the processes X t and Y t are 
defined, such that 

u(j(h)) - u{x) < u(X c (uj)) - u(Y c (u)) 
Since d(X^, Y^) = d(j(h), x) — h, we can find a point £ on the geodesic joining with Y^ such that 

- u(x) < u(X ( (u)) - u(Y c (uj)) < \Vu(£)\h. 

Since either X t or Y t are on the boundary of D, we conclude that £ is distance h or less from the boundary dD. 
Thus, as h goes to 0, from the compactness of dD, we can find a point a 6 dD such that 

v(x) = |Vm(x)| < |Vu(a)| < max 

xGdD 

which concludes the proof. □ 
Another application with a similar flavor is to the gradient of a solution to the heat equation. 

Theorem 15. Let M be a Riemannian manifold with non-negative Ricci curvature, D c M a relatively compact open set 
with smooth boundary, and u a classical smooth solution to the heat equation d t u = \ Au on an open relatively compact set 
fl containing the closure of D. For x e D, let fj, x ^r> be the distribution of the hitting time £ of dD for a Brownian motion 
started at x. Then, for each iefl and < s <t 

(8.2) \\7u t (x)\ < / \\7ut-cr\dDHx,D(d<j) + fix,D((s,oo))\Vu t - s \D, 

Jo 

where for a bounded set K and a continuous function w on K, we denoted \w\k = sup^g^- |iu(x)|. 
Note that in the case when u is harmonic, (8.2) implies (8.1). 
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Proof. As in the previous proof, consider a point x and let 7 be a geodesic such that 7(0) = x and 

IV7 I \ I V MlW) ~ U t{ X ) 

\Vu t (x)\ = lim : 

h.->o h 

Using the fixed distance coupling started at j(h) and x given by Theorem 9, and with £ the first time either X t 
or Yf hit 3D, and combinining with the fact that u t - s {X s ) and u t - s (Y s ) are local martingales, we get for h > 0, 

u t{l{h)) - u t (x) = E[u t -sAc(X sA( ) - Ut- S At( Y sAc)] 

= E[u t _ c (X f ) - u t - C (Y c ), (<s}+ E[ut- s {X s ) - ut_,(y 5 ), s < (} 

< hE{\Vu t _ c \ {dD)h ,( < s] + hE[\Vu t _ s \ D ,s< C] 

where denotes the set of points of M at distance at most h from K. Dividing by h and letting h go to 0, the first 
part follows. 

For the second part, if u is a harmonic function and the maximum of u on D is attained at a point x E D, then 
taking this point in (8.2) gives (8.1) □ 

8.1. The Brownian Lion and the Man. We started this paper with the Lion and the Man and we close it with a 
simple remark on it. Assume we have a Riemannian manifold M satisfying the condition (6.1) Theorem 9. Then, 
given a Brownian Lion running on M, Theorem 9 assures that there is a strategy for the Brownian Man which 
keeps him at the safe positive distance from the Lion. 

In addition, if the Ricci is non-negative, then the Brownian Man can choose a strategy which keeps him at 
fixed distance from the Brownian Lion. This must be particularly frustrating for the Lion especially if they start 
relatively close to each other. 

The very same Theorem shows that if the Ricci curvature is bounded below by a positive constant, then given a 
Brownian Man, the Brownian Lion has a strategy which will bring him arbitrarily close to its meal. 
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